Draft version April 14, 2008 

Preprint typeset using 1^1^^ style cmulatcapj v. 03/07/07 



00 

O 
O 
(N 

< 



6 



> 

in 

(N 

^' 

O 
oo 
O 



X 



PC 1643+4631A,B: THE LYMAN-a FOREST AT THE EDGE OF COHERENCE 
C. M. Casey^-^ C. D. Impey\ C. E. Petry\ A. R. Marble\ R. Dave^ 

Draft version April 14, 2008 

ABSTRACT 

This is the first measurement and detection of coherence in the intergalactic medium (IGM) at 
substantially high redshift (z~3.8) and on large physical scales (~2.5 h^^ Mpc). We perform the 
measurement by presenting new observations of the high redshift quasar pair PC 1643+4631A, B 
and their Lya absorber coincidences. With data collected from Keck I Low Resolution Imaging 
Spectrometer (LRIS) in a 10,200 sec integration, we have full coverage of the Lya forest over the 
redshift range 2.6 < z < 3.8 at a resolution of 3.6A (~ 220 km s^^). This experiment extends multiple 
sightline quasar absorber studies to higher redshift, higher opacity, larger transverse separation, and 
into a regime where coherence across the IGM becomes weak and difficult to detect. Noteworthy 
features from these spectra are the strong Damped Lya Absorbers (DLAs) just blueward of both 
Lya emission peaks, each within 1000 km s^^ of the emission redshift but separated by 2500 km s^^ 
from each other. The coherence is measured by fitting discrete Lya absorbers and by using pixel flux 
statistics. The former technique results in 222 Lya absorbers in the A sightline and 211 in B. Relative 
to a Monte Carlo pairing test (using symmetric, nearest neighbor matching) the data exhibit a 4(t 
excess of pairs at low velocity splitting (Au < 150 km s^^), thus detecting coherence on transverse 
scales of ~2.5 h^^ Mpc. We use spectra extracted from an SPH simulation to analyze symmetric pair 
matching, transmission distributions as a function of redshift and compute zero-lag cross-correlations 
to compare with the quasar pair data. The simulations agree with the data with the same strength 
(~4ct) at similarly low velocity splitting above random chance pairings. In cross-correlation tests, the 
simulati ons agree when t he me an flux (as a function of redshift) is assumed to follow the prescription 
given by lKirkman et al.l (|2005| ). While the detection of flux correlation (measured through coincident 
absorbers and cross-correlation amplitude) is only marginally significant, the agreement between data 
and simulations is encouraging for future work in which even better quality data will provide the best 
insight into the overarching structure of the IGM and its understanding as shown by SPH simulations. 
Subject headings: quasars: absorption lines — intergalactic medium — cosmology: observations — 
quasars: individual (PC 1643-h4631A, B) 



1. INTRODUCTION 

Quasars have become vital tools for understanding the 
nature of the intergalactic medium (IGM) over most of 
the Hubble time. Blueward of Lya emission, the ab- 
sorption in the Lya forest traces the one-dimensional 
distribution of the neutral component of the interven- 
ing material— the intergalactic medium— along the line 
of sight. Si n ce th e first analyses of the forest by 
ISargent et al.l ()1980D . astronomers have used Lya ab- 
sorption spectra to characterize the physical state of the 
IGM at various epochs. The IGM is highly ionized and 
absorption of neutral hydrogen ( H I) scales as a power 
law according to the gas density (|Rauch|[l998D . At the 
lowest redshifts, this so-called Gunn-Peterson approxi- 
mation breaks down since a non-negligible amount of the 
absorbing hydrogen has collapsed into dense structures 
or warm narrow-line Lya absorbing gas. 

Over the past twenty years, several studies have shown 
that Lya absorbers are better tracers of dark matter 
potential wells and baryon density than galaxies, sup- 
porting expectations fro m theory and consistent with 
gas dynamic simulations (ICen et all 119941: IZhang et al.l 
I1995L iHernauist et"aI1 119961 : iDave et al.l Il999ll The 
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HST Quasar Absorption Line Key Project (iBahcall et al] 
119961 : iJannuzi et all 119981 : IWevmann et al.l Il998h used 
many single sightlines with z < 1.6 to probe the com- 
plex geometry of the evolving "Cosmic Web" through its 
voids and dense regions. The role of hydrodynamic simu- 
lations becomes important when considering bulk prop- 
erties of the I GM as inferred by Lya flux decrements 
(|Marble et al.|[200 7): the data from the Lya forest show 
the characteristics of the universe in long but widely- 
separated redshift paths while simulations perform the 
forward experiment, modeling the evolution of the uni- 
verse in finite volumes that are limited by number of 
particles and resolution. 

Multiple sightline experiments, ori ginally presented us 
ing gravita tionally lensed quasars (* Shaver et al.l 
Shaver fc R obertso n! Il983t Wev msnin & Fofe 



1982 



1983 



Foltz et al.lll984t iSmette et aLfTl992l 11995 ). are used to 



infer the IGM's structure in two dimensions through 
cross-correlation of the spectra giving characteristic co- 
herence lengths. With pairs (or groups) of different 
transverse separations and different redshifts, the evo- 
lution in the typical structures of the IGM can be mea- 
sured. The measurement of coherence is based on the sta- 
tistical excess of matched, coincident absorber features 
across the lines of sight. 

The motivation for extending coherence measurements 
to higher redshift and larger transverse separation, where 
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the signal is anticipated to be weak, is that it repre- 
sents a new regime for testing the gravitational insta- 
bility paradigm that underlies the description of large 
scale structure on scales larger than galaxies. One study 
yielded a marginal detection of line correlations in a grid 
of 10 quasars on tran sverse scales of ^ lOh^Q at z ^ 2.5 
(jWiinger et al.ll2000[ ). but the authors did not compare 
the signal to theorectical expectations. The same group 
subsequently used moments of the tran smission probabil- 
ity density to measure the coherence (|Liske et al.ll200dl) 
and saw diagreement with th e hydrodynamic simula- 
tions of ICen fc Simco j (|1997t l. although they did not 
make direct comparisons as in this paper. Other mo- 
tivations for this type of study include the possibility 
of discovering non-Gaussian struc tures like voids , as ob - 
served on these scales at z ~ 2 by iRoUinde et al.l (|2003f ) , 
or testing for non-gra vitational effects, as anticipated by 
iFang fc Whiti (|200l . 

Line of sight correlations are washed out on distance 
scales less than the the redshift resolution, and the trans- 
mission distribution on velocity scales of less than sev- 
eral hundred kms~^ is modulated by peculiar velocities. 
Transverse correlations can be effectively measured given 
a mean SNR > 10, which allows peaks in the opacity 
distribution to be located in velocity reliably and with a 
precision better than instrumental resolution. Suitable 
interpretation of course depends on treating data and 
hydrodynamic simulations identically for the coherence 
measurement. 

These observations of the high redshift quasar pair PC 
1643-I-4631A, B focus on the redshift range from z — 2.6 
(Lyman limit cutoff at 4377A) to Lya emission at Zem = 
3.8. With better resolution and higher signal-to-noise 
than previous William Hershel Telescope (WHT) spec- 
tra analyzed bv I Saunders et all (|1997f ). these dat a allow 
a detailed study of the Lya absorbers. fSaundcr s et al.l 
((l997) had 12A instrumental resolution and a 1500 sec 
integration yielding poor blue data sufficient only to de- 
termine if the pair was a lens, since there had been ear- 
lier claims of a str ong CMB decrement towards the pair 
pones et al.lll997l) . They did not present analysis of Lya 
absorber coincidences. At redshift 3.790, quasar A has 
magnitude Ba = 20.0 ± 0.3 and quasar B, at redshift 
3.831, has magnitude Bb = 20.7 ± 0.3; the two quasars 
have an angular separation of 198". The spectra and 
redshifts are suflicently different that there is no doubt 
that the quasars form a physical pair rather than a lens 
system. 

The best basis for estimating the requirement for a 
definitive detection of co herence on 3 Mp c scales 
comes from the work of iRollinde et al.l (l2003( l. They 
observed five pairs and a quad spanning angular scales 
that encompass the anglular separation of the pair in 
this paper, and their VLT data has almost identical res- 
olution. Their N-body simulations indicate that a mean 
SNR ^ 50 yields a la detection of flux correlation for 
a separation of 200 arcseconds, so assuming Gaussian 
noise, a 3ct detection would take 10 pairs with this quality 
of data or a single pair with SNR 150. In this exper- 
iment we explore two metho ds of measuring coherence; 
the first fits the predictions of lRoUinde et al.l ([2003) since 
measuring coherence is difficult with poor SNR~30, and 
the second is matching Lya absorbers across the sight- 
lines using nearest neighbor matching. 



In ^ we discuss the observations and data reduction, 
and |J3] introduces the mechanics of obtaining spectra 
from simulations. The use of automated software, named 
ANIM ALS, to fit lines and continua to spectra and sim- 
ulations is described in ||4l Physical characteristics of the 
spectra are given in ^ while the absorber characteristics 
are given in ^Sl The coincidences between sightlines are 
described in SjT] We compare simulation extractions to 
the data in fJH both in terms of individual sightlines and 
their transmission properties, and jointly to understand 
coherence in the transverse dimension. 

2. OBSERVATIONS AND DATA REDUCTION 
2.1. Processing the LRIS Spectra 

All data for this project were obtained with the Keck 
I Low Resolution Imaging Spectrometer (LRIS) on 21 
July 2001 and were reduced using the standard IRAF 
longslit-spectra pipeline. To cover the wavelength range 
of scientific interest, only the red side of the spectrograph 
and the 900/5500 grating was used. The red CCD has 
two amplifiers; at the time of the observations the left 
side had a gain of 1.97 e~/ADU and read noise 6.3 e~, 
and the right had gain 2.10 e^/ADU and read noise 6.6 
e~ . The LRIS red CCD also has unexposed rows due to 
the sub- array readout, but these presented no practical 
problem in the data analysis. Since both quasars were 
placed on the long slit simultaneously, all images were ex- 
amined to make sure no alignment shift occurred during 
the observations. The target was continuously tracked 
and the slit was aligned close enough to the parallactic 
angle to avoid significant loss of blue light. Lyman limit 
absorption means there is little useful data below 4400A. 
These July 2001 data had an effective integration time 
of 10,200 seconds. A previous dataset from 2 June 2001 
was of too poor quality to be useful. 

While most data reduction steps were performed 
with the standard IRAF packages, we used two tasks 
[Iccdproc and Irisbias) that are specific to LRIS. The 
column bias effects from the overscan region were re- 
moved from all science frames using Iccdproc with gain 
of 1.97 and read noise 6.3 e~ entered as parameters for 
the left channel. The images were then trimmed to the 
appropriate size and zerocombine was used to combine 
the bias frames. The biases were averaged together to re- 
duce cosmic rays with a sigma-clipping algorithm. With 
Iccdproc, this averaged bias frame was then subtracted 
from the remaining images. 

To remove pixel-to-pixel variations and optical vi- 
gnetting, we created a flat field image using flatcombine. 
The only complication was a gain discontinuity between 
right and left amplifiers. The discontinuity is not no- 
ticeable in the low exposure frames (the science images) , 
but it was removed by dividing each side of the combined 
flat by its respective gain. Once this correction had been 
made, we used response to fit a continuous spectral re- 
sponse function to the flat frame. After the resultant flat 
field was normalized, the remaining images (flux calibra- 
tion and science) were divided by the flat fleld, removing 
pixel-to- pixel variations down to a level of 3%. 

2.2. Spectral Extraction 

For cosmic ray rejection, we combined five individual 
1800 second exposures and one 1200 second exposure 
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in 2D before extraction. To line up the quasar spec- 
tra, each image was shifted in x and y coordinates using 
imshift. The images were then averaged together us- 
ing imcombine and its 'cosmic ray rejection' algorithm. 
Prior to extracting the spectra using apall (part of the 
package AP EXTRACT), suitable regions for sky sub- 
traction were chosen by visual inspection. There was 
a star close to quasar B that had to be avoided. For 
quasars A and B, the trace had a FWHM of 6-7 pixels, 
indicating seeing of 0.6" during the observations. Even 
after sky subtraction, the strong sky line at 5577A shows 
up as a residual artifact in the final spectra. Apall was 
used to create a weighted variance spectrum array, an 
unweighted spectrum array, a background spectrum ar- 
ray, and an error array for each quasar. A second order 
trace function was used to linearize the spectra. 

2.3. Wavelength and Flux Calibration 

For wavelength calibration, the lamp spectra were ex- 
tracted using the same trace functions given by the 
quasar pair A and B. HgNeArCdZn and CdZn lamps 
were taken before and after the quasar exposures. The 
I RAF tasks identify and reidentify were used to itera- 
tively fit lines, with solutions for all permutations of the 
two lamps and the two target apertures inspected. The 
final wavelength solution applied to A and B was based 
on the HgNeArCdZn lamp since it had a more uniform 
distribution of lines; after excluding one line from the 
fit, the final RMS wavelength error was 0.054A, which is 
~2% of a resolution element. The wavelength solution 
was applied using the I RAF tasks re f spec then dispcor 
sequentially. The sampling of the final reduced spectra 
was 0.85A per pixel. 

Flux calibration is not necessary to achieve the scien- 
tific goals of the experiment, but it is useful for seeing the 
true continuum shape. To calibrate the flux, we divided 
each amplifier by its gain (to eliminate the right/left 
channel discrepancy) and remultiplied by 1.97 (the left 
channel gain). This preserves the normalization of the 
counts, but eliminates the discontinuity of the spectrum. 
We ran apall to extract a spectrum for the standard star, 
BD 284211, then used standard and sensfunc to fit the 
sensitivity function of the LRIS instrument. The flux cal- 
ibration was applied using calibrate and the final spectra 
are shown (with 1.5 pixel boxcar smoothing) in Figure [TJ 
The fiux and error arrays are available in the electronic 
version of the article. 

2.4. Instrumental Resolution and Signal to Noise 

To calculate a representative value for instrumental 
resolution, we used the average FWHM from our ob- 
servation run. Blended lines, saturated lines, peripheral 
lines widened by defocusing, and low intensity lines were 
excluded from the resolution calculation, and only the 
FWHM of lamp lines in the Lya forest wavelength range 
were used. Averaging the FWHM of 11 features with 
characteristic Gaussian line profiles gives an instrumen- 
tal resolution of 3.6 ± 0.3 A. At 5000 A, this corresponds 
to -220 km s-^. 

To calculate signal to noise (SNR) for each quasar, we 
used the output fiux and error arrays from IRAF. First, 
we divided the signal by the error for each wavelength bin 
and averaged the SNR value over the Lya forest range. 
This method gives the SNR of quasar A as 32.2 ± 5.7 



and 16.8 ± 4.3 for quasar B. A second method averages 
the signal over the Lya region (fairly constant), averages 
the flux error similarly, and divides the two values. This 
gives the SNR for A as 31.3 ± 7.4 and SNR for B as 
16.6 ± 6.4. Since the methods agree well, we round the 
SNR values to 32 for quasar A and 16 for quasar B in 
the remainder of this paper. 

3. SPECTRA DRAWN FROM SIMULATIONS 

We will compare our cross-correlation measurements 
with simulations, to see if current structure formation 
models can reproduce observations. For this purpose, 
we considered pairs of synthetic Lya absorption spec- 
tra extracted from the smoothed particle hydrodynamic 
(SPH) cosmological simulation G6 (described below; 
ISpring el fc HernquistI 120031 : iFinlator et all l2006( l. We 
extracted 1,000 paired fines of sight (separated by 198 
arcseconds, converted to physical transverse distance as 
a function of simulation redshift) randomly distributed 
across each of the three orthogonal faces of the simu- 
lation box for four different epochs (z = 2.5, 3.0, 3.5, 
and 4.0) giving a total of 24,000 individual spectra. The 
physcial transverse separations of sightlines at these four 
epochs are 2.16, 2.54, 2.61, and 2.66 h^Q Mpc respec- 
tively. These extractions were originally made for a study 
of the Alcock-Paczynski eff ect in the Lya forest, and are 
described in more detail in iMarble et al.l ()2007t ) . 

The 06 simulation is an extension of the G-series 
(|Springel fc Hernauistl[2003l ) . which was run with a mod- 
ified versi on of the N-bo dy+SPH galaxy formation code 
Gadget (jSpringel et al.. ,2001). The following cosmolog- 
ical parameters were employed: ftm — 0.3, = 0.7, 
f2& = 0.04, as = 0.9, and h = 0.7, consistent with first- 
year W MAP data as well a s Lya forest power spectrum 
results (jSpergel et al.|[2003l ). The relatively large volume 
of G6 (100 Mpc comoving on a side) is necessary 
to accurately model matter correlations on the scale of 
few Mpc. With 486^ dark matter and 486'^ gas particles, 
it is one of the largest cosmological hydroydnamic sim- 
ulations ever done, and its large dynamic range is criti- 
cal for resolving the small-scale structure associated with 
the Lya forest. The corresponding gas mass resolution 
of 9.79 X 10"^ h~^MQ is not ideal for detailed study of the 
Lya forest as it does no t quite resolve the Jeans mass in 
the IGM (|Schavell200lh . but given that the observations 
considered here have a spectral resolution of 220 km/s 
(i.e. a distance of about 0.7 Mpc/h in Hubble fiow), the 
mean interparticle spacing of 0.2 Mpc/h is sufficient to 
resolve fluctuations at the level probed by the data. 

Radiative heating and cooling were calculated assum- 
ing photoionization equilibrium and optically thin gas. 
Additional prescriptions for star formation and super- 
nova feedback were incorporated, though the impact of 
these subgrid prescriptions on the Lya forest is min- 
imal. Additionally, G6 included galactic outflows via 
a Monte Carlo ejection of gas from star-forming re- 
gions, where the outflow speed is taken to be 484 km/s 
and the mass loading factor (i.e. the mass outflow 
rate relative to the star formation rate) is taken to be 
2. Recent work has shown that this particular out- 
flow model is not as successful as one where the out- 
flow velocity scales with the characteristic velocity of the 
galaxy (i.e. the "momentu m-driven wind" scalings of 
iQppenheimer fc Pavel [2006f) . For instance, momentum- 
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Fig. 1.— The spectra of the quasars PC 1643+4631A (top) and 
and the mean signal to noise ratios are 32 and 16 respectively. The 
yielding the full coverage of the hya forest, but little data redward 

driven wind scalings better repr oduce IGM enrich- 
ment (jOppenheimer fc Pavel I2006D. the g alaxy mass- 
met allicityredat!ioii|Rnl^^ and obser- 



B (bottom). The quasars have magnitudes Ba = 20.0 and Bb = 20.7, 
red end of the LRIS spectrograph covers wavelengths 4200A to 6IOOA, 
of hya emission. 



vatio ns of high-redshift galaxies (iDave fc Op penheirnen 
among other things. However, [Marble ct al] 
demonstrated that the outflow prescription has 
only a very minor impact on correlations in the Lya for- 
est; in particular the difference in Lya forest correlation 
between using the G6 wind model and a momentum- 
driven wind model is < 5%. This is primarily because 
outflo ws enrich a r elatively small volume of the Uni- 
verse (jOppenheimer fc Dave..2006.) , so do not impact the 
bulk of the volume traced by Lya absorbers. 

Finally, a spatially uniform photoionization back- 
ground was included, wi th the spectr al sha pe an d red- 
shift evolution given bv iHaardt fcMadaul (|i996l ). For 
moderate ovcrdensities characteristic of the Lya for- 
est {p/{p) < 10), the amphtude of this background 
can be effectively and subsequently c hanged by rescal - 
ing the resulting opacity distribution (jCroft et al.lll998f) 
to match the desired mean transmitted flux, (/), of 
the Lya forest. However, (/) remains an observation- 
ally uncertain quantity. Measurements of (/) made di- 
rectly from high-resolution echelle spe ctroscopy (e.g., 
iKim. Cristiani. fc D'Odoricol (|2001[ ) and iKirkman et all 
( 20051 )) have vielded consistentlv higher values than 
those based on extrapolation from redward of the 
Lya forest in much larger samples of low e r-reso lution 
spectra (e.g., [Press. Rvbicki. fc Schneideil ([1993D and 



iBernardi et al.l (|2003[ )). with relative differences of 5% — 
25% at 2.5 < z < 4.0. For this reason, two sets of simu- 
lated spectra were produce d, corresponding to the mean 
flux prescr iptions of both Press. Rvbicki. fc Schneide"rl 
([1993) and IKirkman et all ([20051 ). hereafter referred to 
as P93 and K05. 

4. ANIMALS PROCESSING 

To detect, fit and analyze quasar absorption lines, 
we used a custom made software package: ANalysis 
of the Intergalactic Medium via Absorption Lines Soft- 
ware (ANIMALS), originally based on a kernel de- 
veloped by Tom Aldcroft with substantial modifica- 
tions by Cathy Petry at Steward Observatory. The 
software is faithful to the methodology of the HST 
Quasar Ab sorption Lin e Kev Project jBahcall et ahl 
1 199611 Jannu zi ct al. 1998 HWevmann et al.lll998f ). and its 
characteristics are described at length in the original 
method paper ([Petrv et al.lll998fl . While it was originally 
designed to handle typical observed quasar spectra, it has 
since been expanded to accomodate spectral extractions 
from SPH simulations (which have much shorter spectral 
coverage but higher resolution than is typical of real ob- 
serva tions) and a.utoma te the process of continuum fit- 
ting ([Petrv et al.ll2002l ). This expansion also included 
improvements to handle data with a wide range of SNR 
and line density— particularly useful when analyzing high 
redshift quasar Lya forest li nes. For a more co mplete 
discussion of the software, see [Petrv et al.[ ([2006[) . 

We use ANIMALS in three contexts in this paper— to 
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fit continua to the data, to fit absorption lines and mea- 
sure their significances and equivalent widths (EW) , and 
to degrade spectra of SPH extractions to the instru- 
mental resolution and signal-to-noise of our observations. 
These processes are described below, and their purposes 
become apparent later in the paper. 

4.1. Continuum Fitting 

The continua for both PC 1643-I-4631A and B were 
fit by automatically computing the average flux value 
m 50A bins using ANIMALS's simple cubic spline fit- 
ting routine. If points on the spectrum deviated by more 
than 2a from the averaged flux values (the "continuum" ) 
in the negative direction, they were flagged as poten- 
tially absorbed pixels and rejected from the averages that 
tether subsequent iterations of the continuum fit. The 
procedure generally converged after four iterations, after 
which the spline fit rested near the top of the spectral 
features. 

This initial fit was not completely adequate. Contin- 
uum fitting is made more challenging by the high red- 
shift of the quasars and modest spectral resolution of the 
data, which means there is H I opacity at essentially ev- 
ery pixel. Combined with the high signal-to-noise ratio, 
this implies that most of the small scale structure is real 
transmission variation. Based on the structure of the 
quasar continuum and careful scrutiny of each feature, 
the continuum was adjusted-with a stiffer fit in most 
cases (on 80 A scales instead of 50 A scales), or (near the 
Lya or Ly/3 emission lines) a higher order fit. In a few 
regions final manual adjustment had to be made to the 
spline fit, where individual points were connected with 
straight lines on a scale of 20 A. The result is a continuum 
that sits at or just below spectral peaks (less-absorbed 
regions) and is low-order enough not to follow individual 
absorber features or fluctuations due to noise. 

We also used ANIMALS to fit continua to our SPH 
extractions to analyze the reliability of our continuum 
fitting techniques for different line densities and under- 
lying opacity distributions as a function of redshift. The 
results are given in ijS.'Sl but in > j4.3l we discuss minor dif- 
ferences between fitting the simulations and the observed 
data. 

4.2. Line Fitting 

ANIMALS fits Gaussian profiles to individual ab- 
sorbers in regions where the flux is below the contin- 
uum fit. The line fitting algorithm allows the cen- 
tral wavelength, amplitude and FWHM of each line 
to be variables. While no limit is set to the fit- 
ted equivalent width (Wobs), this analysis is based on 
the assumption that individual features are unresolved, 
since the spectral resolution (~220 km s~^) is well be- 
yond the upper bound of the distribution of absorber 
Doppler parameters, as measured by echelle observations 
(|Kim. Cristiani. fc D'Odoricoll200il i. At the redshifts of 
this experiment, 80% of the absorbers will have Doppler 
parameters in the range 20-50 km s~^, contributing a 
negligible broadening in quadrature to the instrumental 
profile. The damped features described in [J5]are excised 
from the spectra during line fitting since ANIMALS is 
not designed to fit the profiles of high column density 
absorbers. 



The process of line selection in ANIMALS focuses 
on maximizing real features and reducing contamination 
from false signals— caused by noise or vari ation in the 
true continuum, which is unknown a priori (jPetrv et al.l 
119981 ). Applying the software to PC 1643-h4631A and B 
yields 240 and 234 unresolved fitted lines respectively. A 
list of all the detected lines is found in Tables [1] and [51 
where each line is characterized by its central wavelength, 
observed EW {Wots), xl (based on the quality of the 
fit), Sfit and Sdet (significances to be discussed in H6.1^ . 
and special notes. Identification of the Lya sample is 
explained in %.2\ resulting in 222 A lines and 211 B 
lines that are used for the analysis. We also perform 
line fitting of SPH simulation spectra, whose details are 
discussed in ^8.1\ 

4.3. Spectral Degradation 

For a fair comparison with observations, the spectra 
extracted from the SPH simulation must be degraded 
to the instrumental resolution and observed SNR of the 
observations. To degrade, the spectra were convolved 
with a Gaussian line spread function, resampled to the 
correct dispersion (0.85A per pixel) using spline interpo- 
lation, and Gaussian noise was added to match the SNR 
as a function of wavelength given by the data. 

Since the simulation extractions each span only 300A 
(corresponding to a 100 h^Q Mpc box), automatic con- 
tinuum fitting can be problematic near the boundary. 
Although periodic boundary conditions in the simula- 
tion prevent any discontinuity in transmission at the box 
edge, an "unlucky" anisotropy in opacity, or region of 
heavy absorption, can skew or tilt the continuum fit. To 
avoid large transmission deviations in the continuum fit, 
the simulated spectra were replicated three times using 
a different noise seed and joined end-to-end. Similar to 
the continuum fits for the data, ANIMALS was used 
to automatically fit continua to the degraded simulation 
extractions using a cubic spline fitting routine. The fits 
used 170-255A smoothing, and the middle third of each 
fit was adopted as the continuum. 

5. CHARACTERISTSCS OF THE SPECTRA 
5.1. Spectral Features 

The quasar spectra in Figure [T] show Lya emission 
peaks at wavelength 5838A for A and 5880A for B. 
An expanded normalized flux plot of the spectra is 
shown in Figure [21 These Lya wavelengths are red- 
ward of the values anticipat ed from the published red- 
shifts ([Schneider et aLlligigih at 5823A and 5872A, but 
their peaks are possibly shifted due to heavy absorption 
by strong intervening damped Lyman absorbers (DLAs) 
just blueward of Lya emission. 

Both spectra exhibit strong, independent damped Ly- 
man absorbers just blueward of Lya emission at 5808A in 
spectrum A and at ^^5859 A in B. The DLAs have separa- 
tions of 770 km and 665 km s~^ from their respective 
quasars (A and B) and are thus associated systems. The 
absorption in spectrum B is bimodal and consists of two 
different absorbers: one at z = 3.827 which is intrinsic 
to the quasar with strong N V absorption, and another 
at z — 3.818 (at 5859A). We claim that two independent 
absorbers, at z = 3.783 (5815A) in A and at z — 3.818 
(5859A) in B, are DLAs. The occurrence of a single close 
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TABLE 1 

Absorption Lines for PC1634+4631A 



Line 


Ac 




xl 


Sfit 


Sdet 


^abs 


Notes 




(A) 


(A) 










(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 



1 4383.28 ±0.10 2.89 ±0.13 6.07 18.40 21.42 2.6057 

2 4390.95 ±0.21 2.12 ±0.19 1.69 14.45 11.28 2.6120 

3 4394.30 ±0.61 0.80 ±0.19 1.69 5.61 4.21 2.6147 

4 4399.17 ±0.21 1.52 ±0.13 1.69 10.94 11.50 2.6187 

240 6021.08 ±0.52 0.31 ± 0.07 2.73 4.45 4.40 



Note. — For each absorption feature listed in col. (1), col. (2) gives the 
central wavelength in A, and col. (3) gives the equivalent width in A. The 
reduced listed in col. (4), and if this value is identical to that of an 

adjacent line, it indicates that the lines were fitted simultaneously. Col. (5) 
gives the significance of the line defined as Wobs/ ^ fit, where ufn is the error 
in the equivalent width, and col. (6) gives the significance of the line defined 
as Wobs/'^det, where crdet is the detection limit of the data at Ac (see text 
for further details). Cols. (7) and (8) contain the redshift of the absorber 
associated with its identification. The Notes labels are as follows: (a) lines 
included in the Lyo sample, (b) lines associated with the opaque system at 
z = 3.777 that are also labeled by their associated identification, (c) lines 
redward of Lya emission which are clearly excluded from the Ly« sample. 
[The complete version of this table may be found in the electronic edition of 
the journal] 



TABLE 2 

Absorption Lines for PC1634+4631B 



Line 


Ac 
(A) 


W,bs 

(A) 




Sfit 


Sdet 


^abs 


Notes 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 



1 4428.96 ±0.38 2.24 ± 0.39 

2 4438.54 ±0.35 2.23 ± 0.33 

3 4443.19 ±0.30 2.99 ± 0.34 

4 4447.21 ± 0.29 2.80 ± 0.33 

234 6001.46 ±0.31 1.25 ±0.20 



0.38 


5.45 


5.77 


2.6432 


a 


0.30 


6.66 


6.74 


2.6511 


a 


0.30 


9.17 


8.88 


2.6549 


a 


0.30 


8.80 


8.49 


2.6582 


a 


0.36 


13.30 


6.27 




c 



Note. — Same as Table 1, but for quasar PC1643+4631B. 



associated DLA with a quasar is quite rare (~2%), so 
the presence of two of these f eatures in a pair with fairly 
large separation is significant (|Hennawi "etan[2006l) . The 
DLAs are too widely separated from each other (by 2500 
km s""'^ or Az = 0.012) to be considered part of the same 
cosmic structure, showing a velocity separation roughly 
the same as that between the quasars. These absorbers, 
and the quasars themselves, may probe a much larger 
overdense region at 3.8. 

In spectrum A, we see another strong damped feature 
from 5005-5060A . This region is blocked out of Lya ab- 
sorber analysis. In spectrum B, there is a somewhat 
weaker system ranging from 5 145-5 160 A. Both spectra 
show Lyman limit systems, at 43 70 A in A and at 4409 A 
in B. Redward of Lya there is very little wavelength cov- 
erage, but a strong N V doublet appears in spectrum 
B at redshift 3.82. No lines redward of Lya could be 
identified. 

5.2. Redshift Estimation 

The most recent es timations of the pair's redshift, from 
[Schneider et"al] (fl991l) . is z =3.790 ± 0.004 for A and 
z =3.831 ± 0.005 for B. Since neither spectrum presented 



here has significant redward coverage, any re-estimation 
of the redshifts must be done with Lya emission, Ly/3 
emission, and the corresponding Lyman limit. The spec- 
trum for quasar A is too irregular to reliably pick out Ly/3 
emission, the peak of Lya emission is slightly shifted as 
described previously, and the Lyman limit (while agree- 
ing with the published redshift) cannot give a very precise 
redshift estimate. In spectrum B, Ly/3 emission is well- 
traced by the continuum fit, but the fitted peak is located 
40A redward of the expected Ly/3 emission peak (using 
the published redshift z = 3.831 ± 0.005). While this off- 
set might be attributed to a faulty wavelength solution, 
we found that both the Lya and the Lyman limit agree 
with the published redshift, and the 5575A sky line af- 
firms that there is no zero point shift in the wavelength 
solution. The deficit blueward of the Ly/3 peak is at- 
tributed to absorption associated with the DLA system 
seen just blueward of Lya emission. This, coupled with 
OVI emission at 1036A, creates the impression of the 
shifted Ly/3 peak. Without any sharp and non-absorbed 
emission peaks, we confirm but cannot improve on the 
published redshifts. 
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Fig. 2. — The normalized flux (relative to the continuum fit) 
of the spectra of quasars PC 1643+4631A (above) and B (below). 
Fitted absorption lines from the Lya sample are numbered (every 
third line) above the spectra and are traced by the dashed line 
overlayed on the spectra. For wavelengths below 4370A in spec- 
trum A and below 4400 in spectrum B, the flux is nearly zero due 
to Lyman limit absorption, so the normalized flux in these regions 
has been set to zero. 



6. CHARACTERISTICS OF THE ABSORBERS 

The line-fitting methodology is taxed at high redshift 
because increased opacity significantly affects the relia- 
bility of line fitting. High opacity can cause a substantial 
underestimation of continuum flux and thus underesti- 
mation of line density and equivalent width. At high 
redshift, line densities are high and blending is a severe 
problem. We discuss these effects quantitatively with 
SPH simulation extractions in H8.3I Below we describe 
the characteristics of absorbers, define the Lya sample, 
discuss contamination from metals and higher order Ly- 
man lines, and calculate line densities. 

6.1. Line Significances 

ANIMALS provides two independent measures of ab- 
sorption line identification reliability. The detection sig- 
nificance, Sdet — Wobsl c^det, rclatcs the strength of each 
line to the detection limit of the data at the correspond- 
ing wavelength. Here, adet is given by the convolution 
of the instrumental line spread function with the Icr flux 
error array, and, as before, Wobs is the fitted equivalent 
width. How well a given line is fit is quantified by the fit- 
ting significance, Sfn — Wobs/ (^WabB^ which is simply the 
ratio of the fitted equivalent width and the uncertainty 
in that mea s ureme nt. For a more detailed discussion, see 
iPetrv etldl ()2006[ ). Since it is possible for a hue to be 
clearly detected but fit poorly (e.g., a blended hue) and 
a feature generated by noise to be reasonably fit with a 
Gaussian, these significance parameters are used in tan- 
dem to assess absorption line reliability. Lines with both 
Sdet < 5 and Sfu < 2 were excluded from the Lya sam- 
ple. These lower limits were selected as a compromise 
between contamination by spurious lines and exclusion 
of legitimate ones. In a direct comparison of line identifi- 
cations for moderate resolution spectra and echelle spec- 
troscopy of the same targets, false positive and genuine 
line loss yields were found to be approximately 8% and 
6%, respectively (Marble & Impey, in preparation). Due 
to high SNR, there are very few low significance lines in 
our sample; only one line at 5114A in spectrum B (which 
is blended with the wing of another line) was rejected us- 
ing these criteria (Figure [3]) . 

6.2. Lya Absorber Sample 

As a prelude to analysis, we must ensure that the sam- 
ple of absorbers represents only Lya lines in the IGM. 
Damped systems appear in both spectra just blueward 
of Lya emission. The redshifts of these features are 
measured as z = 3.778 ± 0.003 for A and 3.818 ± 0.003 
for B. We performed a search for metal lines associated 
with these systems which produced statistically insignif- 
icant results. To declare a match, the wavelength of a 
"matched" line had to be within 2a\ of the redshifted 
metal line, corresponding to the redshift of the DLA. 
With this procedure, five lines were matched to the DLA 
in A (Ly/3 , Ly7 , LyS , N II, Si III), and eight lines 
were matched to the DLA in B {Ly/3 , Ly7 , Lyi5 , Lye 
, Ly( , Ly77 , Ly^ , C II). Since the line density is so 
high at high redshift, we tested the significance of our 
matches by performing an offset experiment in which we 
shifted the spectrum and matched metals at each itera- 
tion. This was done 400 times, from a 100 A offset red- 
ward to lOOA offset blueward using a O.SA stepsize (the 
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Fig. 3. — Fitted significance vs. detection significance for A and 
B absorbers (76 highly significant lines from A and 9 from B lie 
outside of the plot boundaries). Base d on previously determined 
significance parameters discussed in i|6.1l we exclude lines with 
Sfit < 2 and Sfjet < 5 from the Lyo sample, only containing one 
line in either spectra: a blended line at 5114A in B. 



stepsize is larger than the wavelength error for a typical 
line). The average number of random matches was 2.1 ± 
1.4 in A and 3.2 ± 1.7 in B. Since the number of metal 
matches were within ~ 2a of the average number of ran- 
dom matches, we do not claim the identifications to be 
significant. However, we contend that we can identify 
the higher order Lyman lines in A and B up to hyS with 
good reliability. The same offset experiment performed 
for higher order Lyman lines reveals a 2a statistical ex- 
cess at zero offset and the strengths of the lines in rough 
agreement with a predicted line strength. These lines are 
removed from the Lya sample and marked as such in the 
Tables. 

To determine the impact of the proximity effect 
(jScott et al.ll2000l ) on absorbers close to Lya emission, 
we considered the luminosity of the quasars. Both A 
and B are relatively faint, with apparent magnitudes 
Ba = 20.0 ± 0.03 and Bb = 20.7 ± 0.03 at redshift 
'^3.8, suggesting the extent of the proximity effect is min- 
imal. The intervening DLAs further limit the extent of 
the proximity effect, to about 1000km s~^, which does 
not include any lines in our lists (save the DLAs them- 
selves). Another consideration is the damped systems 
that in each case sit close (~800 km s~^) to the emission 
redshift. With a high column density, these features fur- 
ther limit the extent of the proposed Stromgen spheres. 
These coupled effects lead us to keep all lines fitted blue- 
ward of the DLA systems at z = 3.778 and z = 3.818, 
which is out to '^1700 km s~^. 
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Fig. 4. — Mean transmission (averaged over ± 150A) vs. wave- 
length, showing little change blueward of Ly/3 emission relative to 
redward (upper panel). The vertical lines mark Ly/3 emission for 
each quasar. The lower panel shows the rolling average of the line 
density in a 150A bin which is independent of equivalent width 
(since the distribution in equivalent width does not significantly 
evolve with redshift). Since we do not detect extra absorption 
from metals or higher order Lyman lines and only observe slight 
changes in transmission and line density, we carry forward into the 
analysis both (a) the entire region from the Lyman limit to Lyo 
emission and (b) the Lyo forest alone. 



6.3. The Lyf3 "Forest" 

To test for Ly/3 contamination blueward of the Ly/3 
emission line we performed an offset experiment. The 
baseline, or zero offset, measurement involved assuming 
each absorber between the wavelengths of Lya and Ly/3 
emission is a Lya absorber, and predicting the wave- 
length positions of Ly/3 from each Lya line. Then we 
shifted the wavelength centers of lines (blueward of Ly/3 
emission) by offset increments of 5 A, and measured the 
number of Lya —Ly/3 pairs. The offsets ranged over ± 
200A . The criterion of accepting a pairing had to be that 
the observed line wavelength was within 2 ax of its pre- 
dicted wavelength, which was on the order of 0.4A (but 
determined uniquely from the corresponding Lya line). 
Over the entire range of offsets, the average number of 
matched pairs was 8.5 ± 2.6 for A and 15.4 ± 4.0 for B. 
At zero wavelength offset, the data show 10 pairs in A 
and 16 pairs in B, so there is no statistical detection of 
Lya —Ly/3 pairs. 
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As further tests of whether we could use the fuU re- 
gion down to the Lyman Umit (z ~ 2.65) to define a 
Lya absorber sample we looked at mean transmission 
and line density. Figure [4] shows that there is no signifi- 
cant change in mean transmission when passing from the 
pure Lya forest to the region where Ly/3 lines are also 
present, the "Ly/3 forest." The line density (calculated 
independent of equivalent width) is also shown in Figure 
13] and is constant across the entire range of data, sup- 
porting the assumption that most lines are Lya unless 
otherwise noted. This line density comparison is valid 
since the observed distribution of equivalent widths does 
not change as a function of redshift. Our discrete ab- 
sorber analysis includes the Ly/? region, while our flux 
statistics studies and cross-correlation calculations pro- 
ceed through analysis with and without exclusion of the 
Ly/3 forest, which will be discussed more thoroughly in 

m 

6.4. Statistical Absorber Properties 

The line culling process described in the preceding sec- 
tions (removing lines from metal contamination, proxim- 
ity effect, poor fitted or detection significance, or damped 
system lines), reduces the full line lists in Tables [Hand 
[2] to the Lya sample: 222 lines in A and 211 lines in 
B. In accordance with previous studies, we express the 
number of absorption lines per unit redshift per unit rest 
equivalent width as 



dzdW W* 



(1 



W 



(1) 



where 7 and W * are determined by m aximum likelihood 
estimation as in [Murdoch et al] (|1986f ) with code written 
by A. Dobrzycki. We can also express the number of lines 
per unit redshift above a fixed equivalent width limit 
(often taken in the literature as 0.32A or O.IGA ), as 



dN 
dz 



Aoii + zy 



(2) 



The most appropriate hne densi t y com parison from the 
literature comes from iBechtoldl (|1994l ). who presented 
a moderate resolution, high redshift (z > 2.6) sample 
(their Table 4, sample 21a for Wthr = 0.32A and sam- 
ple 22a for W thr = 0.16A), a nd from iKim et all (|1997f ) 
(likewise from lHu et al.|[l995l ). who used high resolution 
data at all redshifts (with Wthr = 0.32 A). We adopt 
the equivalent width threshold value of 0.32A since it 
is above our limiting equivalent widths for A and B at 
nearly all wavelengths. Splitting the data into five red- 
shift bins each centered at redshifts 2.75, 2.99, 3.26, 3.47 
and 3.67 (excluding data from the damped feature in A 
surrounding 5005A-5060A) we present dN/dz in Figure 
[5l The difference between the two earlier studies is read- 
ily understood as an effect of line blending and limited 
resolution. Kim et al. did simulations to show that their 
completeness to 0.32 A lines at z ^ 3 was about 90%. 
Bechtold only was able to recover about half of the lines 
at this strength, and our data is intermediate in com- 
pleteness. 

We see broad agreement with the two earlier studies 
except in the lowest one (for B) or two (for A) redshift 
bins. Since the Ly/3 forest begins in the middle of the 
second lowest redshift bin, around z ~ 3, this casts some 
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Fig. 5. — Lya absorber density as a function of re dshift for 
lines w ith Wp > 0.32A , comp ared to two previoius studies fBechtoldl 
1IT99I) . and|K im et all II1997I) , both with lower limits on equivalent 
width of 0.32A. The higher line density of the latter study is due 
to better signal-to-noise and resolution. Line density is calculated 
in five bins (avoiding the strong DLA in spectrum A at ~5030A) 
centered on redshifts 2.75, 2.99, 3.26, 3.47 and 3.67. No evolution 
of strong lines is apparent since we underestimate the continuum 
and thus underestimate line counts and line strengths at higher 
redshifts. 

doubt on using this region in the Lya analysis. From 
this direct comparison of dN/dz with literature values 
of high resolution data, we estimate the number of ab- 
sorbers that are averaged together due to the spectral 
resolution and SNR. As a function of redshift (split into 
the five redshift bins earlier described at z=2.75, 2.99, 
3.26, 3.47, and 3.67), the number of hypothetical Lya 
absorbers divided by the observed number of absorbers 
is 0.67, 0.96, 1.36, 1.51, and 1.50 (i.e. blending of ~ 3 
true Lya absorbers into 2 fitted lines at the highest 
redshifts). 

While the additional signal to noise in spectrum A 
might lead one to believe that more absorbers should 
be fit than in spectrum B, the resolution and high line 
density of this high redshift data washes out the depen- 
dence of line count on signal to noise directly. We fit 
all lines with the assumption that they are unresolved, 
and had line profiles equal to the resolution of our ob- 
servations (3.6A). The separation between adjacent fit- 
ted lines is on order 2-4 times this wavelength interval. 
Since the separation between lines and width of lines is 
comparable, and lines are restricted to have a minimal 
seperation of at least one sampling unit (0.85A), it can 
have a enormous effect on the effective line density of the 
data. Weaker features are washed out of the experiment 
due to the resolution limit of line fitting. So the addi- 
tional lines that one might expect to exist in spectrum 
A were likely neglected due to blending. When fitting 
lines to the simulation spectra (as part of our compari- 
son with simulations in §8.ip , we find that the number of 
lines fit to either sightline (A or B) are comperable, and 
S/N does not have an effect on the lists from resolution 
limitations and a very high line density. 

The distribution in rest EW for both Lya forests is 
shown in Figure [5] with exponentials overplotted that 
only fit data above the Wthr = 0.32A limit. Above this 
limit, the distribution is well-described by an exponen- 
tial. At high redshift (z > 3.4) the continuum fit from 
HA.1\ is suspected to significantly underestimate the true 
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Fig. 6. — The distribution in rest equivalent widths for Lya 
absorbers from both sightlines. Above 0.32A, the distributions 
are moderately well-described by exponentials. The exponential 
fits (dashed Unes) are described by Na{Wo) = 48. 42exp(-2. 291^0) 
for A and NpiWo) = 50.91exp{-2.42iyo) for B. The distribution 
shapes are similar, and when split into liigher and lower redshift 
bins show no sign of evolution. 

continuum level (discussed quantitatively in i i8.3p . which 
means that the equivalent widths of high redshift ab- 
sorbers are underestimated. We observe no evolution of 
the distribution of equivalent widths with redshift. The 
effect on line count and line strength from continuum 
underestimation is complex, based both on the omission 
of weaker absorbers and the underestimation of equiva- 
lent widths from stronger features. This systematic bias 
in the continuum applies equally to both sightlines and 
does not affect measures of the transverse coherence be- 
tween discrete absorbers (since the effect is equal for both 
sightlines), but it will significantly impact measures of 
transverse correlation when considered in flux statistics 
tests, which is why we quantify the underestimation in 

7. ABSORBER COINCIDENCES 

7.1. Symmetric Matching 

Symmetric pairing of lines across the spectra is done 
via nearest neighbor matching in wavelength space. A 
pair is considered symmetric if both lines are each oth- 
ers' nearest neighbor, i.e. the line from A is the nearest 
feature in wavelength to the line in B and vice versa. 
This matching method is the simplest way to measure 
coherence with discrete measures of opacity like absorp- 
tion lines. We find 152 matches across the sightlines; the 
specifics of the absorber properties and separations are 
listed in Table [31 Next we explore the properties of the 
symmetric matches in this dataset, the scale on which we 
expect matches to be made, and whether or not coher- 
ence can be inferred from this high redshift data relative 
to Monte Carlo simulations. 

7.2. Characteristic Absorber Separations 

Symmetric pair matches may just be chance pairings. 
The mean separation between matched pairs is 1.4A or 
~84 km s~^, as seen as a peak in Figure [71 In contrast, 
the average separation between adjacent lines in each in- 
dividual spectrum (a measure of line density) is 6.1 ± 
3.6A (-370 km s'^) in A and 6.8 ± 4.2A (-410 km s^^) 
in B. If the requirement of symmetry (A matches to B 



and B matches to A) is dropped in the matching pro- 
cess, the mean A A increases as expected (to 3.4±6.5A). 
To indicate the physical scale on which lines are being 
matched, we calculate the velocity scale of differential 
Hubble flow. Using an angular separation of 198", a typ- 
ical radial line separation translates to AAo = 4.5A, or 
-270 km s"^ 

7.3. Monte Carlo Experiment 

We perform a Monte Carlo experiment to test the sig- 
nificance of symmetric line matches, by making matches 
across sightlines as a way to detect coherence (above the 
chance matching resulting from randomly placed lines). 
This test is not limited or compromised by low reso- 
lution because lines are typically centroided to 30 km 
s^^(— O.sA), which is —7 times smaller than our 220 km 
s~i(3.6A) resolution pinshaw et all 119981 . , using the 
same method for HST data of similar resolution) . Since 
the line velocity error (30 km s~^) is much less than the 
mean velocity splitting of matched pairs (84 km s^^), 
the measurement error does not impact this line match- 
ing coherence measure. With 1000 realizations, the ex- 
periment recreates the two sightlines, sampled to mimic 
the line density and equivalent width distribution of the 
observed spectra, and computes symmetric matches be- 
tween the two randomly drawn sets of absorbers. The 
entire range of the data is used, including the Ly/3 forest. 
This adds needed line statistics to the Lya sample, and 
since line fitting is somewhat immune from varying mean 
flux and continuum fits (which exhibit strange behavior 
in the Ly/3 region as can be seen by excessive opacity in 
Figure [H particularly in A) there is no concern that line 
matches in the Ly/3 region are not meaningful. The only 
risk in including the Ly/3 region is double counting Lya 
and their Ly/3 counterpart matches, which will also trace 
coherence in the IGM. This double counting is unlikely 
since we found very little evidence that Ly/3 contamina- 
tion is significant, and at most, would only change the 
resuh by -10%. 

To create the simulated Monte Carlo samples, we 
use the limiting equivalent width (or detection limit) 
as a function of wavelength, which hovers about 0.24A. 
Damped regions are removed from the simulated spec- 
tra as from the data: 5005-5060A in A and 5145-5160A 
in B. We overplot the results of this Monte Carlo match- 
ing experiment on our observed matches in Figure[71 The 
data show a — 4(t coherence signal above the Monte Carlo 
random matching result in the 50-100 km s^^ and 100- 
150 km s^^ bins (pair excesses are 3.9a and 3.7a respec- 
tively). The experiment was repeated by splitting the 
sample into strong and weak lines (split at 0.5A, roughly 
half the sample) , but it did not show that coherence was 
more prevelent in either strong or weak absorbers. The 
first bin (0-50km s~^) is depopulated due to line blend- 
ing; few symmetric matches on small velocity splitting 
scales would be counted if blended features containing 
two absorbers correspond to single absorbers in the op- 
posite sightline. Therefore, we do not claim that the 
offset signal (peak between 50 km s~^ < Av < 150 km 
s^^) is due to inherent shear in the IGM. 

8. COMPARISON WITH THE SIMULATIONS 
8.1. Absorber Coincidences from Simulations 
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TABLE 3 
Symmetrically Matched Lva Lines 



Match Number 
(1) 


(A) 

(2) 


As 

(A) 

(3) 


|AA| 

(A) 

(4) 


|A^1 
(km s ^) 
(5) 


Wo,A 

(A) 
(6) 


Wq,b 

(A) 
(7) 


lAWol 
(A) 

(8) 


1 


4428.00 


4428.95 


0.95 


65.0 


0.08 


0.61 


0.52 


2 


4439.29 


4438.54 


0.75 


50.6 


0.67 


0.60 


0.06 


3 


4445.02 


4443.18 


1.83 


123.5 


0.93 


0.81 


0.11 


4 


4449.20 


4447.20 


2.00 


134.8 


0.38 


0.76 


0.37 


152 


5781.83 


5781.31 


0.52 


27.4 


0.25 


0.37 


0.11 



Note. — Each symmetrically matched pair is labeled 1-152 in col. (1), with col. 
(2) and (3) giving the wavelengths of the matched absorbers in spectrum A and B 
respectively. Column (4) gives the wavelength separation of the pair, and col. (5) 
gives the velocity difference in km s~^. Columns (6) and (7) give the rest equivalent 
widths of the lines (for A and B respectively) and col. (8) gives the difference in 
rest equivalent widths. [The complete version of this table may be found in the 
electronic edition of the Journal] 
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Av (km s~^) 

Fig. 7. — The velocity splitting of 152 symmetrically matched pairs. The results of random matching Monte Carlo experiments are 
overplotted with 90%, 95%, and 99% confidence intervals. The first data bin, from 0-50 km s^^, is significantly limited by the instrumental 
resolution and thus contains fewer matches than high resolution echelle spectra would exhibit. The 50-100 km s~^ and 100-150 km s~^ bins 
show a 3-4(T coherence signal above the Monte Carlo expectation. Wider velocity splitting bins with Av > 150 km follow expectation 
from random pairings. 
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Fig. 8. — The velocity splitting distribution of absorbers in 
the spectra extracted from the simulations. The left panel shows 
the distribution as a function of realization redshift. For reference, 
we have overplotted the random absorber result from Monte Carlo 
tests (as shown in Figure (Tjl, and we see that the simulations show 
a weak but detectable coherence signal above random chance pair- 
ings. When properly weighted over the redshift interval of our data 
(2.6 < z < 3.8), we show the simulations alongside the data result 
and the random test result in the right panel. 

To further interpret our ~4(t detection of coherence 
from the previous section, we perform a similar symmet- 
ric pair matching on absorbers in the spectra extracted 
from simulations (as described in fJS])- Line fitting was 
done with the same ANIMALS utility that was used for 
the data, this time applied to the degraded extractions 
from simulations designed to match the data. Figure [5] 
(left panel) shows the velocity splitting distribution at 
each redshift for symmetrically matched pairs in asso- 
ciated A and B simulated spectra (with the MC ran- 
domization result overplotted for reference). At higher 
redshift, greater line densities generate a steeper peak at 
zero splitting as expected, while lower redshifts give shal- 
lower distributions. To make a fair comparison with the 
data, we interpolate the distribution shape that simula- 
tions would produce for a spectrum continuously sam- 
pling redshifts 2.6 < z < 3.8. This is shown alongside 
the randomized result and the data in the right panel 
of Figure [H This not only reaffirms that we can detect 
weak coherence in the lya forest at high redshifts, but 
that the simulations agree with our data in showing a 
signal of roughly the same strength. 

The one area of disagreement between data and de- 
graded extractions from the simulations is in the first 
velocity offset bin (0-50km s^^). This can be explained 
simply as a depopulation of the smallest velocity offset 
bin in the data due to line blending (i.e. the discrep- 
ancy is non-physical). Although this happens to some 
degree in the spectra extracted from simulations, the set- 
tings in ANIMALS used to generate line lists from the 
simulations allow lines to be fitted with smaller wave- 
length separations (below the 0.85A restriction applied 
to the data) since the spectra themselves are so short 
(only 300A , rather than the data that span ~1500A). 
The lowest bin, where the turn-down occurs in the data, 
correponds to splittings less than 40% of the instrumen- 
tal resultion. At these high redshifts, where blending is 
severe, the transverse matching result will be sensitive 
to the exact criterion used for minimum line separation 
in each sightline. The slight depopulation of simulation 
matched pairs relative to data at high velocity splitting 
is a consequence of the same effect. To first order, this 
preliminary test for high-z coherence is successful and 
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Fig. 9. — The transmission distributions of 3000 spectra ex- 
tracted from SPH simulations at four discrete redshifts. The first 
column shows the distributions of non-degraded simulation spec- 
tra, with no noise added and very high resolution, which character- 
izes the intrinsic cosmic evolution of the IGM in the simulations. 
The second column shows the distributions for simulations after 
degrading to the signal-to-noise and instrumental resolution of the 
spectra, but without rescaling the mean flux. The distributions 
of A and B do not differ despite a difference in SNR, so they are 
combined. The effect of resolution is particularly visible at high 
redshift. The third and fourth columns represent the same distribu- 
tions from column 2 after being rcscaled by mean fl ux according to 
P93 llPress. Rybi cki. IFSchne ider 1993) and K05 l IKirkman et al.l 
120051) respectively. The vertical lines in the three last columns 
represent the ave rage transmission value of the fitted continua as 
described in i|8.3l 

it encourages additional testing of the simulations that 
can be done in the future more effectively using higher 
quality data. 

8.2. Evolution in the Transmission Distribution 

The spectra extracted from the G6 simulation (Sj3]) 
show significant changes in opacity distributions over the 
redshift range of our data. At their initial high resolution 
and infinite SNR, the transmission distributions show 
substantial absorption at higher redshift while very little 
at low redshift, the anticipated strong cosmic evolution 
from z = 2.6 to z = 3.8 (Lyman limit to Lya emission 
in our data). The transmission distributions in the first 
column of Figure [5] combine the statistics of both A and 
B simulated sightlines, non-degraded and not rescaled to 
mean flux via either P93 or K05. At z = 2.5, the lowest 
redshift of simulation extractions, there is a strong peak 
at high transmission indicating little absorption relative 
to the true continua; the small tail at zero transmission 
indicates the number of pixels in damped systems. At 
higher redshifts, z = 3.0 and 3.5, there is a clear shift 
in the distribution towards lower transmission, and the z 
= 4.0 distribution suggests opacity at nearly every pixel. 
The number of opaque pixels increases tenfold from z = 
2.5 to z = 4.0 from 3.1% to 20.0% of all pixels. 

After degrading the raw extracted spectra to the 
signal-to-noise and resolution of the data, the second col- 
umn of Figure [H] shows changes in the overall transmis- 
sion distribution at each redshift. There was no observed 
SNR effect on transmission distributions shape, so the 
statistics from both fabricated sightlines were combined 
and weighted by relative SNR. The effects are as ex- 
pected; the peaks have broadened, there are fewer pixels 
with zero transmission, and since noise has been added, 



Lyman-a Forest of PC 1643+4631A,B 



13 



a small fraction of pixels have transmission greater than 
one. With increasing redshift, the distribution shape 
evolves from a high transmission peak to a nearly lin- 
ear fall off with higher transmission at z — 4.0. In the 
figure, the dotted line indicates zero opacity (the true 
continuum), and the dashed line indicates the average 
level of the continuum fit after the simulation extractions 
are degraded. 

If the degraded simulations are altered to match the 
mean flux estimates given by P93 and K05 the transmis- 
sion distributions change shape, as shown in the third 
and fourth columns of Figure [H The P93 rescaled sim- 
ulations have very similar shapes to the raw degraded 
simulations, while the K05 rescaled simulations do not 
show as much dramatic evolution at high redshift. This 
comparison shows the importance of mean transmission 
in interpreting data where it is not known a priori. 

8.3. Underestimation in Continuum Flux 

From the simulation, we infer a systematic error in 
AN IM ALS continuum fitting as a function of redshift. 
When degraded to the resolution of our observations, the 
smoothing of transmission on small velocity scales shifts 
the peak transmission values lower, and so depresses any 
algorithmic fit to the continuum. This effect becomes 
larger at higher redshifts as the mean opacity in the IGM 
increases. The vertical dashed lines on columns 2-4 of 
Figure [9] represent the transmission values of the fitted 
continuum (with respect to the true continuum level). 
When we compare the data's transmission distributions 
to those of the simulation, we use the estimate of the con- 
tinuum fit transmission to rescale the data's flux, thus 
correcting the underestimation and making the compar- 
ison between data and simulation fair. The flux decre- 
ment of the continuum may be modeled as a parabolic 
function of (1 -t- z): 

/rfcont(l + ^) = 0.697- 0.419(1 + + 0.065(1 + (3) 
with variance a^j^ — 0.002, and valid for 2.5 < z < 4.0. A 
quadratic fit is empirically chosen because a linear fit is 
an inappropriate choice to model such redshift evolution. 
The rescaling of degraded spectra to either P93 or K05 
does not have any additional impact on the underesti- 
mation of continuum flux. Pixels that have very high or 
low transmission are not affected by rescaling the mean 
flux as much as pixels with intermediate absorption, and 
since the continuum fitting is based on the highest trans- 
mission pixels, the rescaling effect will be minimal, par- 
ticularly at low redshift. At high redshift the size and 
uncertainty in the continuum underestimation grow as 
would the effect of rescaling by mean flux, but it is clear 
that the systematic error in the fit dominates the contin- 
uum flux value over the smaller variations between raw, 
P93, and K05 mean flux rescalings. 

This systematic effect leads to an underestimation of 
line density as well as equivalent width. At high red- 
shift, where the effect is most severe, only the strongest 
and highest column density features will be fit via 
ANIMALS methods, while the much weaker lines in 
blended regions are not fit. The size of this effect can- 
not be properly measured since line fitting relative to 
the true continua is not a unique process when there is 
opacity at nearly every pixel. EchcUc spectra can miti- 
gate this problem, since moderately strong features will 



■ z = 2.75 




\ 


z = 2.99 / 




:/1 


\ 


. z = 3.47 




v 






I 


Data 

Kirkman 05 

Press, Rybick 

&c Schneider 93 

Transmission 
Distributions 



0.0 0.2 OA O.S O.S 1.0 1.2 0.2 0.4 0.6 0.8 l.O 1.2 



Fig. 10. — Transmission distributions of combined data for both 
quasars split into five redshift bins centered at 2.75, 2.99, 3.26, 
3.47, and 3.67. The distributions of spectra extracted from simu- 
lations are overplotted at these redshifts by linearly interpolating 
the transmission distribution shapes from the four extracti o n red - 
shifts: 2.5, 3.0, 3.5, and 4.0. IPress. Rvbicki. fc Schrieldeil ||T993|) 
(dashed lines) and Kirkman ct al. (2005) (solid lines) mean flux 
rescalings show quite different distribution shapes, particularly at 
high redshift. The data transmission was rescaled to correct for 
the systematic unde restimation in the AN I M ALS continuum fit, 
as described in i|8.3l 

be fully resolved and reliably measured regardless of the 
true continuum level. 

8.4. Comparisons of Transmission Distributions 

When comparing the observational data to simulations 
we must recall that the SPH extractions represent dis- 
crete epochs (z = 2.5, 3.0, 3.5, and 4.0) while the data 
samples redshift continuously with cosmic evolution su- 
perimposed. We split the spectra into flve bins ~300A 
wide, centered at redshifts 2.75, 2.99, 3.26, 3.47 and 3.67. 
These redshift bins were chosen to avoid problem areas 
in both spectra— the large damped system in quasar A 
(5005-5060 A), all data blueward of the Lyman limit in 
either spectrum (A >4420A), and all data redward of the 
DLAs close to the Lya emission peaks (A <5790A). The 
first two bins, z = 2.75 and 2.99, cover the Ly/3 forest 
while the three subsequent bins, z = 3.26, 3.47 and 3.67, 
cover the Lya forest and are slightly narrower in width 
(~ 280A). 

We linearly interpolate the shapes of the simulations' 
transmission distributions to infer the distribution shape 
at these five intermediate redshifts. We do these proce- 
dures using both P93 and K05 sets of spectra, adopting 
different mean flux values for the simulations. Figure [TOl 
shows the transmission distributions for the data (his- 
togram) and the simulations (P93 scaling is dashed and 
K05 scaling is solid) at the five redshift bins. Since the 
continuum flux has a significant effect on transmission 
(particularly at high redshift), we rescale the normalized 
data flux by a factor representing the mean AN IM ALS 
fitted continuum flux (as a function of redshift). The 
transmission of each pixel in the data is rescaled by this 
continuum flux factor, given by equation [3l which re- 
moves the effect of the systematic underestimation in the 
continuum flt. 

At the lowest redshift (z = 2.75 in Figure [TU)) . the 
data show far more low transmission pixels than antic- 
ipated. The extra absorption relates directly to the ex- 
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Fig. 11. — The distrib ution in cross- c orrelat ion ampli- 
tude for the simulations, with IKirkman et aO I|2005I 1 (top) and 
IPress. Rvbicki. fc Schn"elde3 I I1993I) fbottom) mean flux rescalines. 
The 2000 simulations used for this experiment are split into four 
(uneven) redshift bins to reproduce the cleanest portions of ob- 
served spectra: 2.75 < z < 2.95, 3.19 < z < 3.22, 3.26 < z < 3.58 
and 3.59 < z < 3.75. The first bin shown includes a portion of 
the Ly/3 forest. The final two panels are combinations of the first 
four: 'ALL' means it covers the entire range of our data (all four 
bins) and 'LYA' represents the exclusive Lya forest experiment, 
z > 3.19 (excluding the lowest redshift bin). The dashed vertical 
lines indicate the cross-correlation measurements for the data. 



cess in line density seen in Figure [51 but is too large 
an effect to be attributed directly to Ly/3 absorbers or 
metal absorbers. Since the signal-to-noise of the blue 
end of the data is quite low, the true transmission dis- 
tribution may be modified by a large flux error. The 
remaining four bins from 2.99 to 3.67 show fairly good 
agreement between simulations and data. To test agree- 
ment with the simulations models, both from the Kirk- 
man et al. and Press et al. treatments, we performed 
K-S tests to test goodness of fit. It revealed that signifi- 
cant agreement (>45%) only occured in the three highest 
redshift bins (z — 3.26, 3.47 and 3.67) using the Kirkman 
mean flux scaling simulations treatment (with probabili- 
ties of 48%, 96%, and 63% respectively). The lower red- 
shift bins in the K irkman treatment, and none of the 
IPress. Rvbicki. fc S chneider (1993) tr ansmission distri- 
butions were proper flts, all with probabilities lower than 
3%. 

8.5. Transverse Correlation Comparison 



To compute the cross-correlation between simulation 
sightlines, the extracted spectra are pieced together from 
the z — 3.0 and z = 3.5 realizations to cover the wave- 
length range of the data. Instead of using linear interpo- 
lation to account for evolution at intermediate rcdshifts, 
piecing together extraction spectra (300A in length) from 
the discrete redshift bins (2.5, 3.0, 3.5, 4.0) is a better 
test of simulation and data agreement. Rather than mea- 
suring cross-correlation for each redshift and then inter- 
polating, we meaasure the cross-correlation from spec- 
tra pieced together to model the redshift evolution. The 
spliced simulation spectra run from 4420A (the Lyman 
limit in B) at the blue end to 3000 km s^^ blueward 
of Lya emission in A (5765 A). Portions of the spectra 
are masked due to features in the data: the damped 
system in A from 5005-5060A, an emission feature in 
A from 5142-5148A, and the 5575A sky line. Since in- 
clusion of the Ly/3 forest region may not be warranted 
(see the discussion in §6.41) . we perform cross-correlation 
in segments: from 2.75 < z < 2.95 (the Ly/3 region with 
SNR > 10), 3.19 < z < 3.22 (the Lya forest between 
the damped feature and Lya emission), 3.26 < z < 3.58 
(the Lya forest between Lya emission and the 5577A 
sky line), and 3.59 < z < 3.75. The results are then 
combined into two larger bins: the Lya forest (all bins 
with z > 3.19) and the entire range (including the lowest 
bin, which spans part of the Ly/3 forest). The cross- 
correlations arc computed for both P93 and K05 mean 
flux scalings. The distribution of the cross-correlation 
amplitudes for the simulations is show n in Figure | 11[ 
The to p set of panels shows the results for lKirkman et ahl 
(|2005h mean flux rescalings while the bottom half show 
those forl Press. Rvbicki. fc Schneider! l|1993f ). The cross- 
correlation amplitude is within l-2cr of zero in every case, 
which emphasizes the difflculty of measuring coherence 
at this physical separation and redshift, given sample 
variance. Overall, the experiment shows that the K05 
rescaled simulations are correctly modeling the Lya ab- 
sorbers at high redshifts since the data measurements 
are consistently within ~1(t of the mean cross-correlation 
amplitude. On the other hand, simulations assumin, 
rescal ed flux according to IPress. Rvbicki. fc Schneide 
(|1993f l do not agree with data, with a difference ofj~8a 
over th e Lya regio n. This agreement with fKirkman 'et al.l 
(|2005') rather than Press, Rybicki, fc Schneider (1993) is 
consistent with the earlier result from transmission dis- 
tributions in §8.41 (as seen in Figure [T0|) . 

9. SUMMARY 

This paper has presented new spectroscopic observa- 
tions of the z ~ 3.8, 198" separation quasar pair PC 
1643-f 4631A, B and associated detection of coherence in 
the IGM on scales that have not been previously tested, 
~2.5 h^Q Mpc. The observations cover the full extent of 
the Lya forest range from Lya emission to the Lyman 
limit (with high signal-to-noise and moderate resolution, 
3. 6 A) and provide an excellent opportunity to not only 
measure coherence and large scale structure in the IGM 
at high redshift, but also compare observations with pre- 
dictions from cosmological simulations. 

The Lya absorber sample was defined in two ways: 
using the entire range of data from Lya emission to the 
Lyman limit, or restricting the Lya experiment to the 
region between Lya emission and Ly/3 emission, dubbed 
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the pure Lya region. Metal and higher order Lyman hnes 
were not detected in the Ly/3 forest region due to the high 
hne densities, but since contamination may be present 
and the blue wavelengths have low SNR we treat the 
Ly/3 forest region separately from the Lya forest in flux 
statistics studies. A Monte Carlo random pairing exper- 
iment using the entire range revealed that chance pairs 
account for a significant portion of the velocity splitting 
of symmetric matches, but in addition to that the data 
show a 4(7 excess of pairs relative to random, near zero 
velocity splitting (Aw < 150 km s~^). This shows that 
multiple sightline coherence techniques work using line 
counting when applied to high redshift quasar pairs, and 
produce significant detections of coherence on ~ 2.5 h^Q 
Mpc scales up to z ~ 3.8. 

This data set have provided a unique opportunity to 
test expecations from simulations. The nearest-neighbor 
absorber matching, transmission distribution and trans- 
verse coherence all indicate agreement between simula- 
tion expectations and results from data, which has not 
been reliably tested before at this high redshift and wide 
separation, primarily since quasar pairs of this type are 
rare. The absorber matching experiment shows that the 
simulations show a weak but detectable coherence signal 
at low velocity splitting, with the same 4ct strength as the 
data. Combining sightline statistics, we compared trans- 
mission distributions in five discrete bins and see that the 



data generally agree with the redshift evolution of the 
simulation's t ransmission dist r ibutio n shapes, and agree 
best with the iKirkman et al.l (|2005f ) treatment of mean 
flux rescaling. The cross-correlation experiment finds 
a weak coherence sign al from the simulation s for both 
mean flux decrements (jPress et al.l[l993.: Kirkman et al.l 
I2005D: the data al s o sho w a weak signal when rescaled 
by iKirkman et al.l (|2005D . agreeing with the simulation 
result, but show a strong coherence nieasur ement if 
rescaled by iPress. Rvbicki. &: Schneideil ()1993D . These 
three separate statistics (arbsorber matching, transmis- 
sion distribution shape and cross-correlation amplitude) 
are mutually consistent and have opened a new regime 
in redshift and wide separation on which to compare the 
structure of the IGM, and agreement between data and 
simulations. 
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